Rapid Prototyping of the Data-Driven Chip-Multiprocessor (d2-CMP) Using FPGAs

نویسندگان

  • Konstantinos Tatas
  • Costas Kyriacou
  • Paraskevas Evripidou
  • Pedro Trancoso
  • Stephan Wong
چکیده

This paper presents the FPGA implementation of the prototype for the Data-Driven Chip-Multiprocessor (D2-CMP). In particular, we study the implementation of a Thread Synchronization Unit (TSU) on FPGA, a hardware unit that enables thread execution using dataflow-like scheduling policy on a chip multiprocessor. Threads are scheduled for execution based on data availability, i.e., a thread is scheduled for execution only if its input data is available. This model of execution is called the Data-Driven Multithreading (DDM) model of execution. The DDM model has been evaluated using an execution driven simulator. To validate the simulation results, a 2-node DDM chip multiprocessor has been implemented on a Xilinx Virtex-II Pro FPGA with two PowerPC processors hardwired on the FPGA. Measurements on the hardware prototype show that the TSU can be implemented with a moderate hardware budget. The 2-node multiprocessor has been implemented with less than half of the reconfigurable hardware available on the Xilinx Virtex-II Pro FPGA (45% slices), which corresponds to an ASIC equivalent gate count of 1.9 million gates. Measurements on the prototype showed that the delays incurred by the operation of the TSU can be tolerated.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

FPGA Prototyping and Design Evaluation of a NoC-Based MPSoC

Chip communication architectures become an important element that is critical to control when designing a complex MultiProcessor System-on-Chip (MPSoC). This led to the emergence of new interconnection architectures, like Network-on-Chip (NoC). NoCs have been proven to be a promising solution to the concerns of MPSoCs in terms of data parallelism. Field-Programmable Gate Arrays (FPGA) has some ...

متن کامل

DDM-CMP: Data-Driven Multithreading on a Chip Multiprocessor

Potential increases for larger number of onchip processors. DDM-CMP is likely to have a large number of small embedded microprocessors Scheduling is often in data driven model of execution Thermal profiling information can be included during re-compilation We have developed: TSIC: Thermal Scheduling Simulator for Chip Multiprocessors to study the several alternatives The Scheduling algorithm de...

متن کامل

Rapid prototyping of an adaptive noise canceler using GRAPE

In this paper we describe the rapid prototyping of an adaptive noise canceler using the Graphical RApid Prototyping Environment (GRAPE), developed at our laboratory. It is an environment which facilitates the realtime emulation and implementation of synchronous DSP applications on heterogeneous target platforms consisting of DSPs and FPGAs. Our work demonstrates the feasibility of real-time pro...

متن کامل

OpenMP Implementation and Performance on Embedded Renesas M32R Chip Multiprocessor

CMP (Chip Multiprocessor) is a promising processor architecture, not only for high performance but also for reducing power and energy consumption in embedded applications. We have implemented an OpenMP compiler for an embedded Renesas M32R chip multiprocessor as a parallel programming environment. In this paper, we report the preliminary performance of OpenMP benchmarks, including scientific an...

متن کامل

RPM: A Rapid Prototyping Engine for Multiprocessor Systems

In multiprocessor systems, processing nodes contain a processor, some cache and a share of the system memory, and are connected through a scalable interconnect. The system memory partitions may be shared (shared-memory systems) or disjoint (messagepassing systems). Within each class of systems many architectural variations are possible. Fair comparisons among systems are difficult because of th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Parallel Processing Letters

دوره 18  شماره 

صفحات  -

تاریخ انتشار 2008